Predicting Intonational Boundaries Automatically from Text: The ATIS Domain
نویسندگان
چکیده
Rela t ing the intonat ional characteristics of an u t ter ance to other features inferable f rom its text is impor t an t bo th for speech recognition and for speech synthesis. This work investigates techniques for predic t ing the locat ion of intonat ional phrase boundaries in na tu ra l speech, th rough analyzing a ut terances from the D A R P A Air Travel In format ion Service database. For s ta t is t ical model ing, we employ Classification and Regression Tree ( C A R T ) techniques. We achieve success rates o f jus t
منابع مشابه
Predicting Intonational Phrasing from Text
Determining the relationship between the intonational characteristics of an utterance and other features inferable from its text is important both for speech recognition and for speech synthesis. This work investigates the use of text analysis in predicting the location of intonational phrase boundaries in natural speech, through analyzing 298 utterances from the DARPA Air Travel Information Se...
متن کاملAutomatic Classi cation of Intonational Phrase Boundaries
The relationship between the intonational characteristics of an utterance and other features inferable from its text represents an important source of information both for speech recognition, to constrain the set of allowable hypotheses, and for speech synthesis, to assign intonational features appropriately from text. This work investigates the usefulness of a number of textual features and ad...
متن کاملThe ATIS Sign Language Corpus
Systems that automatically process sign language rely on appropriate data. We therefore present the ATIS sign language corpus that is based on the domain of air travel information. It is available for five languages, English, German, Irish sign language, German sign language and South African sign language. The corpus can be used for different tasks like automatic statistical translation and au...
متن کاملTraining intonational phrasing rules automatically for English and Spanish text-to-speech
We describe a procedure for acquiring intonational phrasing rules for text-to-speech synthesis automatically, from annotated text, and some evaluation of this procedure for English and Spanish. The procedure employs decision trees generated automatically, using Classi cation and Regression Tree techniques, from text corpora which have been hand-labeled by native speakers with likely locations o...
متن کاملارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متنکاوی در حوزه یادگیری الکترونیکی
As computer networks become the backbones of science and economy, enormous quantities documents become available. So, for extracting useful information from textual data, text mining techniques have been used. Text Mining has become an important research area that discoveries unknown information, facts or new hypotheses by automatically extracting information from different written documents. T...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1991